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RAW SEQUENCE LISTING 
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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /p/?fe^ I'^jQ 

Source: //^o^Q/ 

Date Processed by STIC: y//^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO-REDUCE ERRORED SEQUENCE LISTINGS- FLEASE USE THE CHECKER 
VERSION 4.2 PROGRAM ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1 EFS-Bio (<httn://www.usnto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box -1450, Alexandria, VA 22313-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 06/05/04): 

U.S. Patent and Trademark Office, 220 20 th Street S., Customer Window, Mail Stop Sequence Crystal Plaza Two Lc 
Room 1B03, Arlington, VA 22202 



Revised 05/17/04 



Raw Sequence Listing Error Summary 



ERROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 

Misaligned Amino The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 

Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text. 

Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 



Patentln 2.0 A "bug" in Patentln version 2.0 has caused. the <220>-<223> section to be missing from amino acid 

"bug" sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Sequence(s) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



Skipped Sequences Sequence(s) 
(NEW RULES) <2 1 0> sequence id number 

<400> sequence id number 

000 



10 



11 



12 



_Use of n's or Xaa's 
(NEW RULES) 



Jnvalid <213> 
Response 



Useof<220> 



_PatentIn 2.0 
"bug" 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species), <220>-<223> section is required when <213> response is Unknown cr 
is Artificial Sequence 

Sequence(s) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



. Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



Page 1 ( 




RAW SEQUENCE LISTING DATE: 09/01/2004 

PATENT APPLICATION: US/10/765 , 12 0 TIME: 12:45:36 

Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\09012004\J765120.raw 

1 <110> APPLICANT: Benner, Steven Albert 
W--> 2 <120> TITLE OF INVENTION: Evolution-Based Functional Genomics 
W--> 3 <140> CURRENT APPLICATION NUMBER: 10/765,120 
W--> 0 <13 0> FILE REFERENCE: 

4 <141> CURRENT FILING DATE: 2004-01-28 
W--> 5 <160> NUMBER OF SEQ ID: 38 

6 <170> SOFTWARE: Macintosh OS 10.3 Microsoft Word v. 2003 



ERRORED SEQUENCES 




E 



938 <210> SEQ ID NO r j_14 

939 <211> LENGTH : (llj^J 

940 <212> TYPE: PRT 

941 <213> ORGANISM: Oryctolagus cuniculus 
W--> 942 <4 00> SEQUENCE: 14 * 

943 Met Leu Leu G.lu Val Leu Asn Pro Arg His Tyr Asn Val Thr Ser , {l^^iji 
> 944 ^005 f ^ ic c*fr \€ ^.On^J^fj^ v , 

945 Met Val Ser Glu Val Val Pro He Ala Ser He Ala He Leu Leu /} A ^JL^<t C^<^A 

946 ^20 ^25 ^30 

94 7 Leu Thr Gly Phe Leu Leu Leu Val Trp Asn Tyr Glu Asp Thr Ser j\JUUi^^^ 

948 £35 (/40 ~ ~ ^45 ' 7 

949 Ser He Pro Gly Pro Ser Tyr Phe Leu Gly He Gly Pro Leu He / aCj} aJ^'*^ 

950 /)50 655 ' d)60 l/^^^ 

951 Ser His Cys Arg Phe Leu Trp Met Gly He Gly Ser Ala Cys Asn yy^ £>LU'V 

952 ^65 A70 975 U ) 

953 Tyr Tyr Asn Lys Met Tyr Gly Glu Phe Met Arg Val Trp Val Cys V, , ^y<^~\ 

954 ^80 $85 (J90 /X^ 1 ^ . ' 

955 Gly Glu Glu Thr Leu He He Ser Lys Ser Ser Ser Met Phe His 

956 ^95 100 105 

957 Val Met Lys His Ser His Tyr He Ser Arg Phe Gly Ser Lys Leu 

958 110 115 ~ 120 

959 Gly Leu Gin Phe He Gly Met His Glu Lys Gly He He Phe Asn 

960 125 130 135 

961 Asn Asn Pro Ala Leu Trp Lys Ala Val Arg Pro Phe Phe Thr Lys 1 , / 

962 140 145 150 m'S [Al^^j 

963 Ala Leu Ser Gly Pro Gly Leu Val Arg Met Val Thr He Cys Ala ' 0 j I 

964 155 160 165 /*Lki 

965 Asp Ser He Thr Lys His Leu Asp Arg Leu Glu Glu Val Cys Asn Wr 

966 170 175 180 , ^) /U^/ 

967 Asp Leu Gly Tyr Val Asp Val Leu Thr Leu Met Arg Arg He Met (J^-rr 1 ^^ 

968 185 190 ~ 195 /, 
96 9 Leu Asp Thr Ser Asn Met Leu Phe Leu Gly He Pro Leu Asp Glu / ytiJJL A^' rh f' 

file://C:\CRF4\Outhold\VsrJ765 120.htm sfajf***** ' L^tylZjJjc) 9/1/04 
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RAW SEQUENCE LISTING DATE: 09/01/2004 

PATENT APPLICATION: US/10/7 65 , 12 0 TIME: 12:45:36 

Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\09012004\J765120.raw 

970 200 205 210 

971 Ser Ala He Val Val Asn He Gin Gly Tyr Phe Asp Ala Trp Gin 

972 215 220 225 

973 Ala Leu Leu Leu Lys Pro Asp He Phe Phe Lys He Ser Trp Leu 

974 230 " 235 240 
9 75 Cys Arg Lys Tyr Glu Lys Ser Val Lys Asp Leu Lys Asp Ala Met 

976 245 250 ' 255 

977 Glu lie Leu He Ala Glu Lys Arg His Arg He Ser Thr Ala Glu 

978 260 265 270 

979 Lys Leu Glu Asp Ser He Asp Phe Ala Thr Glu Leu He Phe Ala 

980 275 280 285 

981 Glu Lys Arg Gly Glu Leu Thr Arg Glu Asn Val Asn Gin Cys He 

982 290 295 300 

983 Leu Glu Met Leu He Ala Ala Pro Asp Thr Met Ser Val Ser Val 

984 305 310 315 

985 Phe Phe Met Leu Phe Leu He Ala Lys His Pro Gin Val Glu Glu 

986 320 325 330 

987 Ala He He Arg Glu He Gin Thr Val Val Gly Glu Arg Asp He 

988 335 340 " 345 
9 89 Arg He Asp Asp Met Gin Lys Leu Lys Val Val Glu Asn Phe He 
990 350 355 360 
9 91 Asn Glu Ser Met Arg Tyr Gin Pro Val Val Asp Leu Val Met Arg 
992 365 370 375 
9 93 Lys Ala Leu Glu Asp Asp Val He Asp Gly Tyr Pro Val Lys Lys 

994 380 385 ' 390 

995 Gly Thr Asn He He Leu Asn Leu Gly Arg Met His Arg Leu Glu 

996 395 400 405 
9 97 Phe Phe Pro Lys Pro Asn Glu Phe Thr Leu Glu Asn Phe Ala Lys 
998 410 415 420 
9 99 Asn Val Pro Tyr Arg Tyr Phe Gin Pro Phe Gly Phe Gly Pro Arg 

1000 425 430 435 

1001 Gly Cys Ala Gly Lys Tyr He Ala Met Val Met Met Lys Val Val 

1002 440 445 450 

1003 Leu Val Thr Leu Leu Arg Arg Phe His Val Gin Thr Leu Gin Gly 

1004 455 460 465 

1005 Arg Cys Val Glu Lys Met Gin Lys Lys Asn Asp Leu Ser Leu His 

1006 470 ^ 475 480 
100 7 Pro Asp Glu Thr Arg Asp <<^— * 

E- - > 1008 485 

1151 <210> SEQ ID NO: 17 

1152 <211> LENGTH: 486 

1153 <212> TYPE: PRT 

1154 <213> ORGANISM: Poephila guttata 
W--> 1155 <400> SEQUENCE : 17 

1156 Met Phe Leu Glu Met Leu Asn Pro Met His Tyr Asn Val Thr He 

1157 >&05 £-10 .£15 

1158 Met Val Pro Glu Thr Val Pro Val Ser Ala Met Pro Leu Leu Leu 

1159 <920 -e-2 5 n^o 

1160 He Met Gly Leu Leu Leu Leu He Arg Asn Cys Glu Ser Ser Ser 




file://C:\CRF4\Outho!d\VsrJ765 1 20.htm 
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RAW SEQUENCE LISTING DATE: 09/01/2004 

PATENT APPLICATION: US/10/765 , 12 0 TIME: 12:45:36 

Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\09012004\J765120.raw 

H61 ^5 fifto .#45 

1162 Ser He Pro Gly Pro Gly Tyr Cys Leu Gly He Gly Pro Leu He 

1163 ..#55 .^ereo 

1164 Ser His Gly Arg Phe Leu Trp Met Gly He Gly Ser Ala Cys Asn 
H65 .^65 ...tfVO J075 
1166 Tyr Tyr Asn Lys Met Tyr Gly Glu Phe Met Arg Val Trp He Ser 
11^7 J0 8O &S5 J 090 
1168 Gly Glu Glu Thr Leu He He Ser Lys Ser Ser Ser Met Val His 

ytf95 100 105 

1170 Val Met Lys His Ser Asn Tyr He Ser Arg Phe Gly Ser Lys Arg 

1171 110 115 ^ 120 

1172 Gly Leu Gin Cys He Gly Met His Glu Asn Gly He He Phe Asn 
H73 125 130 ! 135 

1174 Asn Asn Pro Ser Leu Trp Arg Thr Val Arg Pro Phe Phe Met Lys 

1175 140 145 150 

1176 Ala Leu Thr Gly Pro Gly Leu He Arg Met Val Glu Val Cys Val 
H77 155 160 165 
1178 Glu Ser He Lys Gin His Leu Asp Arg Leu Gly Asp Val Thr Asp 
H79 170 ~ 175 ~ J 180 
1180 Asn Ser Gly Tyr Val Asp Val Val Thr Leu Met Arg His He Met 
H81 185 190 ~ 195 

1182 Leu Asp Thr Ser Asn Thr Leu Phe Leu Gly He Pro Leu Asp Glu / // n a J 

e--> 1183 zoo - zcS ^/o £r /T^^r^f^ '/ 

1184 Ser Ser He Val Lys Lys He Gin Gly Tyr Phe Asn Ala Trp Gin /y ^JL^f 

■"> H85 215 220 225 6^*9++*+ 

1186 Ala Leu Leu He Lys Pro Asn He Phe Phe Lys He Ser Trp Leu ^WH^^ 
E "-> 1187 230 235 240 

118 8 Tyr Arg Lys Tyr Glu Arg Ser Val Lys Asp Leu Lys Asp Glu He 
E--> 1189 245 250 255 

1190 Glu He Leu Val Glu Lys Lys Arg Gin Lys Val Ser Ser Ala Glu 
E "-> 1191 260 265 270 

1192 Lys Leu Glu Asp Cys Met Asp Phe Ala Thr Asp Leu He Phe Ala 
E "> H93 275 280 " 285 

1194 Glu Arg Arg Gly Asp Leu Thr Lys Glu Asn Val Asn Gin Cys He 
E--> 1195 290 295 300 

1196 Leu Glu Met Leu He Ala Ala Pro Asp Thr Met Ser Val Thr Leu 
E--> 1197 305 310 315 

1198 Tyr Val Met Leu Leu Leu He Ala Glu Tyr Pro Glu Val Glu Thr 
E--> 1199 320 325 330 

1200 Ala He Leu Lys Glu He His Thr Val Val Gly Asp Arg Asp He 
E~-> 1201 335 340 345 

12 02 Arg He Gly Asp Val Gin Asn Leu Lys Val Val Glu Asn Phe He 
E — > 1203 350 355 360 

12 04 Asn Glu Ser Leu Arg Tyr Gin Pro Val Val Asp Leu Val Met Arg 
E--> 1205 365 370 375 

12 06 Arg Ala Leu Glu Asp Asp Val He Asp Gly Tyr Pro Val Lys Lys 
E--> 1207 380 385 390 

1208 Gly Thr Asn He He Leu Asn He Gly Arg Met His Arg Leu Glu 
E--> 1209 395 400 405 
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RAW SEQUENCE LISTING DATE : 09/01/2004 

PATENT APPLICATION: US/10/765 , 120 TIME: 12:45:36 

Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\09012004\J765120.raw 





1210 


Tyr 


Phe 


Pro Lys Pro 


Asn 


Glu 


Phe 


Thr 


Leu 


Glu 


Asn 


Phe 


Glu 


Lys 


E--> 


1211 






410 










415 










420 




1212 


Asn 


Val 


Pro Tyr Arg 


Tyr 


Phe 


Gin 


Pro 


Phe 


Gly 


Phe 


Gly 


Pro 


Arg 


E--> 


1213 






425 










430 








435 




1214 


Ser 


Cys 


Ala Gly Lys 


Tyr 


He 


Ala 


Met 


Val 


Met 


Met 


Lys 


Val 


Val 


E--> 


1215 






440 










445 








450 




1216 


Leu 


Val 


Thr Leu Leu 


Lys 


Arg 


Phe 


His 


Val 


Lys 


Thr 


Leu 


Gin 


Lys 


E--> 


1217 






455 










460 










465 




1218 


Arg 


Cys 


lie Glu Asn 


Met 


Pro 


Lys 


Asn 


Asn 


Asp 


Leu 


Ser 


Leu 


His 


E--> 


1219 






470 










475 








480 




1220 


Leu 


Asp 


Glu Asp Ser 


Pro 


















E--> 


1221 






485 























1445 <210> SEQ ID NO^_35 

1446 <211> LENGTH: 

1447 <212> TYPE; 

1448 <213> ORGANISM: Sus scrofa 
W--> 1449 <400> SEQUENCE: 35 

C--> 1450 caa^attac acgtgccgat ttggcagcaa acttgggttg gaatgcattg gcatgcatga 60 \ 
E--> 1451^aaggcatcaJ_tgtttaacaa taa x - a if ~ on A- L'h^J^f 

1454 <211> LENGTH: 84 J O^rH^^-^^ 

1455 <212> TYPE: DNA (tilHA *Xc^ 

1456 <213> ORGANISM: Sus scrofa j ' ' SjL 
W--> 1457 <4 00 > SEQUENCE: 36 W /^(l 
C--> 1458 tagtcactac acatcccgat ttggcagcaa acctgggttg cagttcattg gcatgcatga 60 /jy^^A^ 
E--> 1459 gaaaggcatt atattcaaca ataa q LL * t f) 

1461 <210> SEQ ID NO: 37 ' ' * VJ JLfr<J^ 

1462 <211> LENGTH: 84 / $ l2* 

1463 <212> TYPE: DNA 1 ^ 

1464 <213> ORGANISM: Sus scrofa 
W--> 1465 <400> SEQUENCE: 37 

C--> 1466 cagtcactac acatcccgat tcggcagcaa acctgggttg gagtgcatcg gcatgtatga 60 ' , 

E--> 1467 gaagggcatc atatttaata atga | i ' cjii /— p^ASiJf" 

> 1469 <210> SEQ ID NO: (3T)Z9 CA&H^ +0 °~ 



E--> 1469 <210> SEQ ID NO: (37j ^ (Jr\(X\>^ 

1470 <211> LENGTH: 84^^ 

1471 <212> TYPE: DNA 

1472 <213> ORGANISM: White lipped peccary 
W--> 1473 <400> SEQUENCE: (H) <£- J^V^ TO 

C--> 1474 cagtcactac acatcccgat tcggcagcaa acctgggttg cagttcattg gaatgcatga 60 / 
E--> 1475 gaaaggcatc atatttaaca acaa " ^If-^/^^^T 



file://C:\CRF4\Outhold\ VsrJ765120.htm 



9/1/04 
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<210> 26 

<211> 50 

<212> 

<213> <^ncestral sequence 
<4 00> 26 — - — - 

Arg Asn Val He Gin He Ser Asn Asp Leu Glu Asn Leu Arg Asp 

^5 ,£10 .^15 

Leu Leu His Leu Leu Ala Ser Ser Lys Ser Cys Pro Leu Pro Gin 

-020 j>2 5 G-3 0 

Ala Arg Gly Leu Glu Thr Leu Glu Ser Leu Gly Gly Val Leu Glu 

£-35 JMO " .^45 

Ala Ser Leu Tyr Ser 

^050 



PCIPI/EXEC/22/6 
Annex VII, page 29 




<110> Smith, John; Smithgene Inc. 

<120> Example of a Sequence Listing 

<130> 01-00001 



<140> 
<141> 



PCT/EP98/00001 
1998-12-31 



<150> 
<151> 



US 08/999,999 
1997-10-15 



<160> 



<170> 



Patentln version 2.0 



<210> 
<211> 
<212> 
<213> 



1 

389 
DNA 

Paramecium sp. 



<220> 
<221> 
<222> 

<300> 
<301> 

<302> 

<303> 
<304> 
<305> 
<306> 
<307> 
<308> 
<309> 



CDS 

(279) .. . (389) 



Doe, Richard 

Isolation and Characterization of a Gene Encoding a 

Protease from Paramecium sp . 

Journal of Genes 

1 

4 

1-7 

1988-06-31 



123456 
1988-06-31 



<400> 1 

agctgtagtc attcctgtgt cctcttctct ctgggcttct caccctgcta atcagatctc 

agggagagtg tcttgaccct cctctgcctt tgcagcttca caggcaggca ggcaggcagc 

tgatgtggca attgctggca gtgccacagg cttttcagcc aggcttaggg tgggttccgc 



60 
120 
180 



cgcggcgcgg cggcccctct 



cgcgctcctc tcgcgcctct ctctcgctct cctctcgctc 



240 



PCIPI/EXEC/22/6 
Annex VII, page 30 
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ggacctgatt aggtgagcag gaggaggggg cagttagc atg gtt tea atg ttc age 296 

Met Val Ser 
1 

ttg tct ttc aaa tgg cct gga ttt tgt ttg ttt gtt tgt ttg ttc caa 344 

Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu Phe Val Cys 
10 15 



atg 


ttc 


age 


Met 


Phe 


Ser 




5 




ttg 


ttc 


caa 


Leu 


Phe 


Gin 


20 







tgt ccc aaa gtc etc ccc tgt cac tea tea ctg cag ccg aat ctt ^ 389 
Cys Pro Lys Val Leu Pro Cys His Ser Ser Leu Gin Pro Asn Leu 
25 30 35 



<210> 2 

<211> 37 

<212> PRT 

<213> Paramecium sp. 

<400> 2 

Met Val Ser Met Phe Ser Leu Ser Phe Lys Trp Pro Gly Phe Cys Leu 

1 5 1-0 15 



Phe Val Cys 



Leu 
20 



Phe Gin Cys Pro 



Lys 

25 



Val Leu Pro Cys 



His 
30 



Ser Ser 



Leu Gin 



Pro 
35 



Asn Leu 



<210> 
<211> 
<212> 
<213> 



3 

11 
PRT 

Artificial Sequence 



<220> 

<223> Designed peptide based on size and polarity to act as a 

linker between the alpha and beta chains ot Protein XYZ . 

<400> 3 

Met Val Asn Leu Glu Pro Met His Thr Glu lie 
1 5 10 



<210> 4 
<400> 4 
000 



[Annex VIII follows] 



VERIFICATION SUMMARY DATE: 09/01/2004 

PATENT APPLICATION: US/10/7 65,12 0 TIME : 12:45:37 



Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\09012004\J765120.raw 



L: 
L: 
■L: 
L: 
L: 
L: 
L: 
L: 
L: 
L: 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L: 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 
L 



2 M:283 

3 M:283 
0 M:201 

5 M:283 W: Missing Blank Line separator, <160> field identifier 
12 M:283 W: Missing Blank Line separator, <400> field identifier 
84 M:283 W: Missing Blank Line separator, <400> field identifier 



W: Missing Blank Line separator, <120> field identifier 
W: Missing Blank Line separator, <140> field identifier 
W: Mandatory field data missing, <130> FILE REFERENCE 



156 
228 
300 
371 
443 
515 
585 
656 
728 
800 
872 
: 942 
: 944 
:1008 
:1014 
:1086 
:1155 
: 1183 
:1185 
: 1187 
:1189 



Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 

M:283 W: Missing Blank Line separator, <400> 

M:283 W: Missing Blank Line separator, <400> 

Missing Blank Line separator, <400> 
Missing Blank Line separator, 



M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 
M:283 



W: 
W: 
W: 
W: 
W: 
W: 
W: 
W: 



M:283 W: 
M:283 W: 
M:332 E: 

M:252 E 

M:283 

M:283 

M:283 

M:332 

M:332 

M:332 

M:332 



field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 
field identifier 



1191 M:332 
1193 M:332 



1195 
1197 
:1199 
:1201 
:1203 
:1205 
: 1207 
:1209 
:1211 
:1213 
:1215 
: 1217 
:1219 
:1221 
:1227 
:1241 
:1255 
:1269 
:1283 



M:332 
M:332 
M:332 
M: 332 
M:332 
M:332 
M:332 
M: 332 
M:332 
M:332 
M: 332 
M:332 
M:332 
M:332 
M:283 
M:283 
M:283 
M:283 
M:283 



W: 
W: 
W: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
E: 
W: 
W: 
W: 
W: 
W: 



<400> 

(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 14 
No. of Seq. differs, <211> LENGTH : Input : 4 78 Found:486 SEQ : 14 
Missing Blank Line separator, <400> field identifier 
Missing Blank Line separator, <400> field identifier 
Missing Blank Line separator, <400> field identifier 



(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 
(32 



ID: 17 
ID:17 
ID: 17 
ID: 17 
ID:17 



Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, 
Invalid/Missing Amino Acid Numbering, 
Invalid/Missing Amino Acid Numbering, 
Invalid/Missing Amino Acid Numbering, 
Invalid/Missing Amino Acid Numbering, 
Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, SEQ ID: 17 
Invalid/Missing Amino Acid Numbering, 
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Missing Blank Line separator, 
Missing Blank Line separator, 
Missing Blank Line separator, 



<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 
<400> field identifier 



file://C:\CRF4\Outhold\VsrJ765120.htm 



9/1/04 



VERIFICATION SUMMARY DATE : 09/01/2004 

PATENT APPLICATION: US/10/765 , 12 0 TIME: 12:45:37 

Input Set : A:\pto.lm.txt 

Output Set: N:\CRF4\09012004\J765120.raw 

L:1297 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1311 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1325 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1339 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1353 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1367 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1381 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1395 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1409 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1419 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1429 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1439 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1449 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1450 M:112 C: (48) String data converted to lower case, 

L:1451 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:84 SEQ:35 
L:1451 M:112 C: (48) String data converted to lower case, 

L:1451 M:252 E: No. of Seq . differs, <211> LENGTH : Input : 83 Found:84 SEQ:35 
L:1457 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1458 M:112 C: (48) String data converted to lower case, 

L:1459 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 84 SEQ : 36 ^ 

L:1459 M:112 C: (48) String data converted to lower case, 

L:1465 M:283 W: Missing Blank Line separator, <400> field identifier 

L:1466 M:112 C: (48) String data converted to lower case, 

L:1467 M.-254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:84 SEQ:37 

L:1467 M:112 C: (48) String data converted to lower case, 

L:1469 M:212 E: (34) Invalid or duplicate Sequence ID Number, SEQ ID NO:37 
L:1473 M:283 W: Missing Blank Line separator, <400> field identifier 
L:1474 M:112 C: (48) String data converted to lower case, 
L:1475 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:84 SEQ:37 
L:1475 M:112 C: (48) String data converted to lower case, 
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